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Abstract 

A mean-reverting financial instrument is optimally traded by buy- 
ing it when it is sufficiently below the estimated 'mean level' and sell- 
ing it when it is above. In the presence of linear transaction costs, a 
large amount of value is paid away crossing bid-offers unless one de- 
vises a 'buffer' through which the price must move before a trade is 
done. In this paper, Richard Martin and Torsten Schoneborn derive 
the optimal strategy and conclude that for low costs the buffer width 
is proportional to the cube root of the transaction cost, determining 
the proportionality constant explicitly. 



Introduction 

A difficult problem in trading algorithm design is linear transaction costs. 
This problem is quite distinct from, and much less analytically tractable 
than, quadratic costs [5], and unless very large positions are being traded it 
is the major source of slippage. 

Traditionally the problem has been considered in the context of trading 
a stock in a portfolio consisting of stock and a risk-free bond (the so-called 
Merton problem) . Loosely, in the Merton problem, to achieve optimal utility 
one needs to maintain a fixed proportion of value in the stock, which ne- 
cessitates continuous trading. Without attention to linear costs, one would 
incur in a time step dt a cost of order ydt, so "any [literal] attempt to apply 
Merton's strategy in the presence of transaction costs would result in im- 
mediate penury" (in Davis & Norman's words |4J). A threshold is therefore 
constructed through which the price has to move before rebalancing is done. 
The idea translates into the trading of an arbitrary asset as follows. If its 
present value X is plotted against the current position 9, the (X, 0) plane 
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is divided into three zones (Figure [T]): a no-trade zone (NT) in the middle, 
and on each side, discrete-trade zones (DT) in which the optimal strategy 
is to trade directly to the boundary. A strategy of this form is said to be of 
DT-NT-DT type for obvious reasons. 



Figure 1: Sketch of DT-NT-DT form of optimal trading strategy. In the NT zone, 
no trade is done. Outside, a trade is done to the edge of the boundary, as implied 
by the arrows. The dashed line is the optimal 9 in the costfree case. 

The Merton problem has been tackled by several authors. Davis & Nor- 
man [4] examine it probabilistically and derive the optimal buffer shape. 
Shreve & Soner [10] examine the same problem using viscosity solutions of 
PDEs to derive their results. A variant has been tackled by Whalley & 
Wilmott [11 J and Zakamouline [12] in the hedging of options under linear 
transaction costs. Intriguingly, both sources of problem produce the same 
conclusion to the extent that the width of the NT region is, for small trans- 
action costs, proportional to the cube root of the cost of trading one lot of 
the underlying asset. 

Our encounter with the linear transaction cost problem has been through 
mean-reversion and the trading of putatively stationary combinations of in- 
struments, which has received impetus in recent years through, for example, 
the theory of cointegration (see e.g. [6j for a discussion). The existence 
of such stationary combinations is a form of potentially exploitable market 
inefficiency and is discussed by Boguslavsky & Boguslavskaya [2] who do 
not treat transaction costs at all. Whereas in the context of the Merton 
problem, one can circumvent the transaction cost problem simply by rebal- 
ancing the portfolio only infrequently |9j, in mean-reverting strategies one 
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is forced to trade frequently, as that is the only way of making mone 

jB So 

the transaction-cost problem has to be fixed rather than side-stepped. 

In this paper we make a number of innovations on the classical case 
considered by Davis & Norman. Rather than dealing with 'cash' instru- 
ments, we assume them to be 'synthetic', for example swaps or futures. As 
an example, one might consider a mean-reverting strategy in long-dated vs 
short-dated government bonds. We set this up by taking a combination of 
futures contracts, maintaining a small margin account and permitting con- 
siderable leverage, rather than trading the underlying securities. In writing 
down the dynamics of this 'combined instrument' ([I]) we have to make some 
changes, for several reasons: there will be no risk- free rate in the drift, we 
wish to introduce mean reversion in the drift, and the volatility will be per- 
mitted to be level-dependenJl. Although we could give an explicit derivation 
for one particular case such as Ornstein-Uhlenbeck (OU), which is a well- 
understood stochastic process, it turns out to be scarcely more analytically 
tractable than the general case, so we deal with that and then give the OU 
result as an immediate consequence. We also require a definition of utility 
that is more suited to trading, being in essence the total discounted expected 
return reduced by terms related to the integrated variation of P&L; in other 
words we are more interested in utility of incremental P&L than of wealth. 
The same approach is, incidentally, adopted by Brandt et al. in a different 
context [3]. For reasons that we explain later, we use constant absolute risk 
aversion. Finally we note that our model requires different boundary condi- 
tions: obviously there cannot be a 'no-shorting' condition, particularly as to 
trade one unit of the combined instrument will probably require being long 
one future and short another. 

The trading of this mean-reverting synthetic asset gives rise to a trans- 
action cost problem, for which we derive the optimal Markovian solution 
of DT-NT-DT forrro. Remarkably, the expression for the optimal DT-NT 
boundary is in reasonably closed form, being described by the solution of a 
pair of coupled nonlinear equations (115)16p . 

Next we perform a perturbative analysis for small transaction costs, and 
give a simple explicit expression for the approximate width of the NT zone, 

1 In the absence of 'carry'. 

2 Spot-dependent, but not time-dependent, local volatility. 

incidentally we do not prove here that there is no better strategy of some other type: 
for example a non-Markovian one in which the optimal trade did not depend simply on 
the market value and the current position. The reader is therefore asked to take this on 
trust. Incidentally a non-Markovian solution to the problem might well be rather difficult 
to implement. 
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finding it to be proportional to the cube root of the transaction cost. This 
thereby corroborates the previously-mentioned work in a broader setting. 

Finally we provide some numerical examples, comparing the optimal 
boundary with the perturbation approximation. Incidentally the optimal 
boundary may also be obtained numerically by the method of dynamic pro- 
gramming, thereby giving an independent verification of the optimal bound- 
ary equations. 

The techniques we use are very straightforward, essentially reducing the 
problem to a linear ordinary differential equation with boundary conditions, 
whose solution we then manipulate; however the algebraic hurdles, particu- 
larly in the perturbation theory, are substantial and have been compressed 
here. 

Notational preliminaries; Costfree solution 

Let Xt be the present value of the aforementioned 'combined instrument' 
that we believe to be mean-reverting. For example, Xt might be a long-short 
combination of long-dated government bonds, or of stocks, or of commodity 
futures. The P&L comes from differences X t +dt — Xt, and given that X t is 
the PV of a synthetic instrument it can be negative. We shall assume that 
Xt obeys a time-homogeneous diffusion: 

dXt = n(Xt)dt + a{X t )dW t . (1) 

Let 9t be the allocation at time t to X. Define the value function 

/oo 
e -'(»-*)E[H(0 a dX 5 )] 
_=t 

whicfQ measures discounted expected utility of changes in P&L stretching 
forward to an infinite horizon. As Xt follows a diffusion, we can simply 
perform the formal expansion 

U{6 s dX s ) = U{0) + 6 s dX s U'{0) + ±6 2 s (dX s ) 2 U"(0) + o(ds), 

so that we only care about the first and second derivatives of U at the origin. 
This is a departure from utility of wealth, in which the whole of the utility 
curve will be explored. We stipulate «(0) = 0, W(0) = 1, W"(0) = 
so that G, which is constant, is a measure of risk appetite and has units of 

4 This is understood to be an Ito integral, i.e. the increment dX s is 'after' S , thus: 
8s(X s +ds — X s ). 



4 



money (because U does, in our formulation). It is natural to query why G 
should be constant. The reason is that, in general, a fund will operate many 
strategies, with each one allocated a risk budget: the portfolio manager 
will specify G for each strategy. Occasionally, G will need to be altered, 
depending on the amount of investment or redemption in the fund, on the 
fund's overall performance, and on the desired style balance. However, in 
between such re-gearing operations each strategy is to run a fixed level of 
risk, and this is our setup. 
We have 

V t = E t [U(9 t dX t )] + (1 -rdt)B t [V t+dt ]. 

Write Vt = f(X t ), as the value function is not explicitly a function of calen- 
dar time, and let dt — > 0. Expanding the expectation using Ito's lemma (or 
the usual Feynman-Kac argument prevalent in option theory) gives 

- rf(X t ) + n{X t )-± + ^(I t ) 2 ^ = -U(X t ,0(X t )) t (2) 

where 

U(x, 6) = jE t [U{6dX t ) \X t = x]= »(x)0 - ^f- 

is the rate of accumulation of expected utility. 

Write &t = g(Xf). It is intuitively clear that the optimal allocation 
to the asset, in the absence of costs, is the value of 6 that maximises the 
incremental utility U(x, g(x)), i.e. 

e t = g (X t ) = -^±G. (3) 

(The o in go indicates the transaction-free solution, and the * denotes opti- 
mally.) This is of the familiar form "expected return variance, x gearing 
factor". As (<92f7)(x, go{%)) = 0, we have that the 'rebalancing gamma' 
(sensitivity of optimal no-cost position to change in price of instrument) is 

(d 1 d 2 U)(X t ,g (X t )) _ ix\X t ) - 2^X t )a'(X t )/a(X t ) 

%{Xt) ~~ {dlu) { x tM x t )) ~ W G ' ( ) 

a result that we will need later as the width of the optimal NT zone is linked 
to it. 

We need some further notation. Write 

m = + 3*(*) 2 ( 5 ) 
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for the infinitesimal generator of the diffusion. Then ([2]) becomes 

(-r + C)[f](X t ) = -U(X t ,g(X t )). (6) 

Solution of linear ODEs boils down to calculation of the complementary 
function (equation with RHS replaced with zero), which we denote C±(x), 
and the Green's function (equation with RHS with a delta-function at x = £, 
for any £), which we denote K(x,^): 

(-r + £)C ± = 0; (-r + C)K(x, = -S(x - £). (7) 

We stipulate (see Appendix) that C + is positive and monotone increasing, 
C_ positive and monotone decreasing. The value function is then given as 

/oo 
U{£,g{t-))K{x,Z)d£. (8) 
-oo 

It is easily established that the Green's function is positive (see Appendix). 
Therefore, maximising the full integrated utility is indeed achieved by max- 
imising the incremental utility as we did above through the choice ([3]). 

Effect of costs: Main results 
ODE for value function 

We are going to assume that the optimal solution to the linear cost problem 
is of DT-NT-DT type. In the NT zone, no trading occurs so the value 
function V t = f(X t ,9t) obeys almost the same equation as before, to wit 

(-r + £)f(x,6) = -U(x,e), 

but note very carefully that / is now a function of two variables (with the 
differential operator C acting on the first one), and that is not adjusted 
as x moves: in ([8|), by contrast, it is variable. The solution can be written 
down immediately as "particular solution + complementary function", i.e. 

/oo 
U(Z,6)K(x,Odt + a + (0)C + (x) + a-(O)C„(x), (9) 
-oo 

where a±{9) are weights to be determined; their values depend on the ge- 
ometry of the NT boundary because the expression is valid only in the NT 
zone. 
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In the DT zone the position is different. As instantaneous rebalancing is 
performed, the market does not have time to move, and the value function 
is obtained by deducting the cost of transacting towards the NT boundary. 
So: 

/<*.»)-/<*.«•)-{ JJ|J do 

where 9^, the target position, is the ^-coordinate of the point on the nearer 
NT boundary vertically above or below the current position (x, 9) (as sug- 
gested by the arrows in Figure [1]). The parameters £+,£_, which have mon- 
etary units, are respectively the costs of buying and selling one lot of the as- 
set, and we call them the TCMs (transaction cost multipliers). Importantly, 
therefore, the value function in the DT zones is determined immediately 
from its value on the boundaries. 



We now need to glue the two solutions (I9|10p together. The value func- 
tion is already continuous at the boundary by virtue of (fTUj) . Imposing 
differentiability in the ^-direction (which we justify in the Appendix) gives 



I{h + {9),9) + a' + {9)C + {h + {9)) + a'_{9)C^{h + {9)) = -e_ 
+ a ;(9)C + (L(e)) + a'_(e)C_(L(9)) = e+ 



Here h + {9) is the value of x satisfying g+(x) = 9, etc., and I(x,9) is defined 
as 



I(x,9) = / (d2U)(Z,0)K(x,Z)dt, 

J — oo 

which obeys the odeH 

{-r + C)I = -d 2 U. (12) 

These matching conditions allow a±(9) to be determined up to an ar- 
bitrary additive constant that can be identified from asymptotic behaviour. 
Writing the determinant 



D(h+,h-) 



C + {h + ) C + {h_) 
C-{h + ) C_(fc_) 



> 



'Notation 82 means differentiate (once) w.r.t the 2nd argument, and so on. 
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and noting that lim 
a + {9) = 



a_(0) 



>+oc a.-(Q) = and lim^-oo a + (9) = 0, we have 



e_C_(fc_(0)) - e+C_(/i+ (#)) 



oo D(h + ,h_) 

I(h-(#),$)C-(h + (V)) - /(/t + (tf),tf)C_(/i_(tf)) d&, (13) 
1 

D(h^JtZ 

+ I(h-(#),#)C + (h + (V))-I(h + (#),#)C + (h„(#)) d-d. (14) 



£ _C + (/ l _(^))- £+ C+(/ i+ ( 1 ?)) 



Inserting these into ([9]) finalises the expression for the value function, 
which as far as we are aware is a new result. One sees immediately that if 
the NT zone width is shrunk to zero then D(h+, h-) — > and the integrand 
becomes singular (— oo). This confirms that in the absence of a no-trade 
zone, infinitely much value is lost through frictional costs, as anticipated. 

Optimal boundary equations 



We now have via ()9ll3ll4j) the value function for a given boundary, and the 
idea now is to optimise the boundary. To do this we need only to maximise 
the part of ([9]) that depends on the boundary location, i.e. 

a + (9)C + (x) + a_(0)C_(x). 

The part of this that depends on h±{6) is a±{9) which in turn are maximised 
by maximising the integrands of (I13|14p . Differentiating these gives the 
positions of the optimal boundaries (denoted h±): 



da' + 



^ 



dh7 







[C' + (h + ){e + C-(h+) + £ _CL(M) - C'_(h + ){e + C+(h + ) +e-C+(h-))] 

+ i(h+,e)(C + (h+)C-(h-) - c_{h + )c + {h-)) 
+ i(h„,e)[c'_(h+)c + (h + ) - c' + (h + )c-(k+)) 
-(d 1 i){h+,e)D(h+,K-) = o (is) 



s 



and 



da', da' 

[C'_(h-)(e+C + (h + ) + e_C+(fc_)) - C' + (h-)(e+C-(h + ) + e_C_(^_))] 
+ I(h+,6)(CL(h-)C+(h-) - C' + (h-)C-(h-)) 

+ i(h-,e){C + {h-)C-.(h+) - c_(h~)c + (h + )) 

+ {d 1 I){h-,B)D{h+,h-) = Q (16) 

where for clarity we have abbreviated h±(9) to h±. These appear to be 
new results. They are a pair of coupled nonlinear (but not differential) 
equations which require the functions C±, C'± and I to be coded. For each 
6 their solution gives a pair (h~(6), which demarcates the edges of 

the optimal NT boundary. 

Small transaction costs 

In practice one would prefer not to have to solve the optimal boundary 
equations but rely on an approximation based on small e±. It turns out 
that the buying and selling costs only ever occur as their sum (intuitively: 
given long enough, the buys and sells cancel out, so the difference in TCM 
does not matter), so we write e = |(e+ + £-). 

Two sets of results can be obtained. The first comes from the differ- 
ence (|15p — (|16p . After much labour this reduces to an expression for the 
(horizontal) half-width of the NT zone, 

i( ft+W -„_(„).(_) . ( 17 ) 

The half-width in the vertical direction is obtained by multiplying by |<?q(x)|: 

: ( , +(i) _,_ W) .(!«V /3 . (18) 



Two things may be seen immediately. The first is the one-third power law 
dependence on transaction cost. The second is the appearance of the 'rebal- 
ancing gamma' <?o( x ) which we introduced previously and is given explicitly 
in terms of the drift and volatility by (j3]) without the need for solving any 
equations. The cube root law agrees with Shreve & Soner's derivation for 
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the Merton problern_|. Incidentally the next terms in the expansions are 
0(e), 0(e 5 / 3 ) and so on. 

The sum (|15p + (|16p gives our second result, which informs of the hori- 
zontal displacement of the mid-point tj(/j+(#) + /&_(#)) from its position in 
the absence of costs, X = ho(6) (so ho(go(x)) = x). This is found, after 
similar effort, to be: 

\ (h + (e) + Uo)) - UO) ~ -e ( a^pg ) ' . ( 19 ) 

and the vertical displacement is 

1 (&.(*) + $_(*)) " ~ "A ( ^G^ ) ; (20) 

again, these can be calculated directly from (|4"|). 

Another question is, how much value is lost as a result of transaction 
costs? To answer this we have to look at the dependence of / on e. Without 
optimising the NT boundary, the dependence of the value function on the 
transaction cost is simply oc e, as is obvious from ()13|14p . More impor- 
tantly, if we vary the transaction cost while simultaneously adjusting the 
NT boundary to keep it optimal, we see from (|13ll4p that the numerator 
is 0(e), as just pointed out, but the denominator is 0(h w ) = 0(e 1 ^) on 
account of the D(h+, /i_) term. Hence the full dependence is oc e 2 / 3 , which 
corroborates Shreve & Soner's deductions [TO] . 



Numerical examples 

The Ornstein-Uhlenbeck process is the simplest model of a mean-reverting 
diffusive process and is specified by (see e.g. [8]) 

dX t = (a - bX t ) dt + a dW t , b > 0. 

We can state our conclusions immediately. First go(x) = (a — bx)G/a 2 . 
Immediately we can find the half-widths of the NT zone in the horizontal 
and vertical directions from (|17|18p as 

6 Stated in their Appendix, [lUj . 
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(c) (d) 

Figure 2: Boundaries of NT zone (dark blue lines) shown in relation to the 
transaction-free optimal allocation (green line) and perturbation approximation 
(pink lines). (a,b) are for OU process and (c,d) for extended OU process ([22]) . 
Parameter values: (a-d) a = 1, a = 0, b = 0.5, c = 1, r = 0.05, G — 2. Local vol 
(v) values: (a,b) 0, (c) 0.125, (d) 0.25. TCM (e) values: (a) 0.03, (b) 0.24, (c,d) 
0.05. 

Using the expression for the displacement as well (|20p. we obtain the ap- 
proximate NT boundary as 

For the purposes of demonstration we may as well assume a = 0, a 2 /2b = 
1 so that the invariant measure of X is standardised to N(0, 1), and that 
G = 2 so that d8/dX = — 1 in the transaction- free case (G only scales 9 so 



11 



its effect is not interesting). The only variables that matter are e and r/b 
(in fact the latter does not even feature in the approximation, and it seems 
to have very little effect in practice) . To obtain the exact boundary we may 
either solve (|15ll6p . which requires C± to be computed (it is essentially a 
parabolic cylinder function in this case: see Appendix) or else use dynamic 
programming (also discussed in the Appendix). In the examples tested, the 
results were indistinguishable. 

We graph the exact solution and this approximation in Figure [2]Ja,b) 
for two different cost levels: e = 0.03 (which in context is cheap) and 0.24 
(expensive). As is apparent by eye from the figures, the optimal buffer width 
doubles as the TCM increases 8-fold. Also the agreement between the exact 
and approximated solutions is good. 

The limit b — > (no mean reversion) is interesting. The optimal costfree 
position is aG/a 2 and the optimal strategy in the presence of transaction 
costs is to trade to the edge of the NT zone, specified by 6 = (a ± er)G/a 2 
(this can be seen directly), and then hold the position without changing it. 
Hence in (|2ip the horizontal width (3cr 2 e/26) 1 / 3 — > oo, and the vertical width 
G(36 2 e/2<T 4 ) 1 / 3 —7- because this is only the leading-order term: as there is 
no continuous rebalancing, a buffer of width 0(e 1 ^ 3 ) is unnecessary, so the 
next order term in the expansion, 0(e), is the pertinent one. 

A sort of 'local volatility' extension to this model is given by 

dX t = -bX t dt + a ( 1 + c 2 X 2 f dW t . (22) 

The equilibrium point is fixed at zero and for v > the volatility increases 
away from it. The no-cost line is no longer straight, as the increased volatility 
away from equilibrium makes larger positions unattractive. Figure (2)^c,d) 
shows the results for this model, given by 

„ .„ -bX bX f2be 2 \ 1/3 {l-Auc 2 X 2 (l + c 2 X 2 )-^ 1/3 

9±/G 



a 2( 1 + c 2 X 2)2v a 2 y 3(J 2 J (l + C 2 X 2 ) 8i V 3 

'3b 2 e\ 1/3 (1 - Avc 2 X 2 (l + c 2 * 2 )- 1 ) 273 



2a 4 J (1 + c 2 X 2 )M 3 

and again there is reasonable agreement between the perturbation expansion 
and the optimal boundary. Notice that the buffer width, in the vertical sense, 
is thinner at the edges (\X\ large) than in the middle, as is seen from f 1 1 8 1) . 
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Conclusions 



We have shown, in the case of a mean-reverting diffusion process, how to 
find the optimal solution of 'DT-NT-DT' type and derived leading-order 
sensitivities to transaction cost. In common with the Merton problem we 
find that the optimal NT zone width is proportional to the cube root of the 
transaction cost. The results have been verified for the OU process and for 
an extension of it. An obvious extension is the multivariate case in which 
the dynamics of Xt are driven by several extra factors; we are working on 
this. 

An interesting phenomenon occurs when, in the extended-OU model 
([22]) . the parameter v exceeds |. Then, the relationship between market 
level X and optimal costfree position go(x) is no longer one-to-one, and 
so the relation X = h(9), heavily exploited in the present analysis, is not 
well-defined: there is a particular problem at X = ±l/(cy/4v — 1), where 
g'(X) = 0. Referring back to Figured! we have solved for the value function 
in the NT zone in horizontal slices X £ [h-(9), /&+(#)], but maybe a better 
approach would be to use vertical ones. However, we have not been able to 
do this, so it is a matter for further investigation. 

Richard Martin and Torsten Schdneborn are with AHL, part of Man 
Group PLC. The views expressed in this paper are their own rather than 
those of their institution. They thank Gunnar Klinkhammer and Thaleia 
Zariphopoulou for numerous helpful discussions and the referees for their 
suggestions for improvement. A longer version is available on request. Email 
rmartinOahl . com, tschoeneborn@ahl . com 
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Appendix 

Note on the complementary functions C±(x) 

The existence of positive solutions to the equation (— r+£)f = 0, one mono- 
tone increasing (C+ ) and the other monotone decreasing (C_), follows from 
standard theorems in ODE theory. Write tp(x) = f'(x)/f(x), a standard 
gambit, to obtain the Riccati equation 

-\o(x) 2 ip'{x) = \a{xf^{x) 2 + n{x)i){x) - r. 

We claim that there exists a positive solution ip+(x) and a negative solution 
ip-(x) to this equation satisfying ip+{— oo) = ip^(+oo) = 0. This will prove 
the required statements about C± = e^ ± . The RHS is a quadratic in ip, 
and factorises as 

?//( x ) = -(V>(x) - {ip{x) - (23) 

say, with ^f + (x) > > (that the roots have opposite sign follows 

immediately from r > 0). We only deal with ip + , as ip- is analogous. By 
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the Cauchy-Lipschitz theorem [7j there exists a unique positive solution to 
the Riccati equation satisfying ?p + (—oo) = 0. If < ip+(x) < ^+(x) then 
RHS ([23|) > so ijj+(x) remains positive as x increases; if ip+(x) > ^+(x) 
then it is still positive. 

Note on the Green's function K(x, £) 

For x < £ and for x > £ the Green's function is just a solution to the 
homogeneous ODE, as the RHS of the differential equation (— r + C)f = 
—S(x — £) is zero, but it is a different solution on each side^- By integrating 
over a small segment at x = £, one finds that K (x, £) is continuous in x but 
its x-derivative jumps by — 2/<r(£) 2 . Putting this together, we obtain 

Jr(x>{)= f o-(Oo + W. «<«i x 



C+({)C_(i), i>{ j I»({)!W{C_,C + }(£)' 

with W{f,g} = /p' — gf denoting the Wronskian. Positivity of K, as 
asserted previously, then follows immediately. 

There is one issue that we have swept under the carpet: in the above 
construction we have implicitly dealt with the boundary conditions, by as- 
suming that the Green's function decays at ±oo. The assumption is that 
the value function obeys the regularity conditions 

lim f(x)/C+(x) = 0; lim f(x)/C-(x) = 0. 

x— >+oo X— >— oo 

This is almost certainly true, but ought to be proven. Probably, it follows 
from a simple bounding argument on the value function. 

Value function with costs; Dynamic programming 

Let denote the position before rebalancing at time t. Then 0t = ®t+dt 
and both are 'known at time t'. If we now have Vt = f(Xt,07), then 



f(X t ,07) = (l-rdt)E t [f(X t+dt ,e t )]+U{Xt,e t )dt- 



e_ | t>t — u t | , vt < o t 
£+\vt — t |, t > 8 t 



We take it as read that 6t is a function g of Xt and 9 t only, i.e. a Markovian 
rebalancing strategy, and so we have 



f(x,0) = (l-rdt)B t [f(X t+dt ,g(X t ,e))\X t 



x 



-hu(x, S (x,«))dt-\ m . (24) 



7 Currently the most accessible discussion is probably the Wikipedia entry for "Green's 
function" . 
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A numerical method for solving (|24p is dynamic programming. First, 
set up a grid (X, 9), and start with some approximation to / such as / = 0. 
Then perform the following iteration, which effectively is one time step of 
size dt, until convergence occurs (in / and g): 

• At each gridpoint (X, 6), proceed as follows: 

— Calculate for all possible rebalanced positions g(X, 6) on the grid 
the expectation term (approximately by observing that at a short 
horizon Xt+dt is roughly Normally distributed with mean fJ,(Xf) 
and variance a(X t ) 2 ). Also calculate the other two terms on the 
RHS of dMD- 

— Record which choice of g(X,9) optimises the RHS of (|24p. 

• Repeat for all other gridpoints. 

Convergence in g occurs much more quickly than in /; indeed for zero trans- 
action cost the convergence in g occurs in one time step. The above iteration 
scheme defines a map / i— > say, that improves the approximation of / to 
the optimal value function. That convergence occurs in / follows from the 
fact that T is a contraction mapping, essentially because the interest rate is 
positive, so errors are slowly discounted. Notice that we have described the 
method for finding the value function and the optimal trading strategy, be- 
cause we are maximising over g, but we do not have to do that: for example 
we might want to know about how much value is lost by deliberately using 
a specified (suboptimal) strategy. That calculation is faster of course as no 
search over g- values is required. 

We recommend this method for finding g (which is usually more im- 
portant than /), mainly as a useful check on the analytical methods and 
approximations that we are about to derive. 

Note on differentiability at the boundary 

Referring to Figure [31 which shows part of the upper NT-DT boundary, we 
wish to compare the value function at A and at D, and claim that for an 
infinitesimal box ABCD, Va — Vq = £-{6o — 8a) to leading order. Consider 
the difference between being at A and being at D, over the next time step 
dt. If X falls in value (A — > F or D — > E), no trading is done, so the 
only difference is through the P&L which is slightly less for A by an amount 
\dXtd0t\ = 0(dt) (as one has a longer position at A than at D). On the other 
hand if X rises, in addition to the P&L difference there is no transaction 
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cost from moving D-fCas this is still in the NT zone, but there is a cost 
of £-\d6t\ = 0{\fdt) in moving A — > B — > C as one has ended up in the 
DT zone. On the other hand, V A - V D = (df/d9)d9 t = 0(Vdt), where in 
evaluating this one naturally uses the expression for / inside the NT zone. 
Equating terms in 0(y/dt) gives df /d9 = — e_ i.e. (fTTI) . A similar argument 
holds for the lower boundary. 



dX t 

F \ A B 





N. DT 




NT \ 



E D \C 



= g+{X) 

Figure 3: Sketch for 'box argument' explaining why the value function is 9- 
differentiable at the boundary. 



Note on the optimal boundary equations 

We refer to ©. As the solution obtained by maximising 

a + (9)C + {x) + a„(9)C-{x) 

is to work for all x, we expect optimisation of a+(0) and at-(0) w.r.t. 

to give the same answer; furthermore, this answer should not de- 
pend on 9 either. As C±(x) > 0, all we have to do is maximise a±. That 
four equations reduce to two follows from the identities 

da' + /dh- C-{h + ) da' + /dh+ C-{hJ) 
da'_/dh- ~~ C+{h + y da'_/dh + ~ ~ C+{hJ)' 

It is necessary for such a reduction to occur, since otherwise we would have 
four equations for two unknowns, resulting in a solution that did not work 
for all values of x. 

Basic properties of the solution (for value function) 

As pointed out in the text, if e± > 0, then a±(9) — > — oo and the value 
function tends to — oo. So the optimal buffer width is certainly never zero. 



17 



What is less clear is what happens for very large transaction costs. There 
are conceivably two possibilities: (A) the value function increases sufficiently 
rapidly with market dislocation to absorb the transaction cost, and make it 
worth trading the asset when it is sufficiently far from equilibrium, even in 
very expensive markets; or (B) above some critical level of transaction cost, 
it is not worth trading. In the OU case, it is (A) that holds, and we suspect 
that this is generally true. 

Next, we should be able to show the rather obvious result that in the 
absence of transaction costs, a buffer of positive width is suboptimal, but 
the suboptimality vanishes as the width is shrunk to zero. This is not as 
easy to prove as it may seem, at least by the methods we have derived here, 
so the following argument can possibly be improved. The no transaction 
cost solution as previously derived is 



The solution we have derived for a buffered strategy is, on the other hand, 
on the line 9 = go(x), 



f(x,g (x))= / K(x,£)U(€,g (x)) d£+a + (g (x))C + (x)+a-(g (x))C-(x) 



(note very carefully that the integrand specifies g$ (x), whereas in Jntc it 
is 5o(£))> where a± have now been found explicitly, and simplifiable given 
that e = 0. We are to show that / — >• /ntc, from below, as the buffer width 
contracts to zero. We put the buffer along the optimal NTC strategy and 
allow the width to contract to 0. 

By the boundary conditions, of which we write [BC] for the LHS, we 
have 



which can also be differentiated (we will use that presently, but not write it 
out here). 

Notice first that 




■oo 



—oo 



I(x,g (x)) + a' + (g (x))C+(x) + a'_(g (x))C-(x) = Vx, 



(-r + C)f NT c{x) = -U(x, g (x 



)) 
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whereas 



(-r + £.)f(x,g (x)) = -U(x,g (x)) 

+ a+(g (x))(-r + C)C+{x) + a-(g (x))(-r + C)C-(x) 
+ [v(x)g' (x) + ±a(x) 2 gZ(x)][BC] 

+ Ia 2 (x)^(x)^[BC] 

+ \a 2 {x)g' Q {x) [(^/j^JoW) + a' + (go(x))C' + (x) + a'_{g {x))C'_{x)] 



where the notation £, emphasises that all the x-dependence is being differ- 
entiated, i.e. that through the first argument of / and through the second 
argument via go . (This is what makes the algebra fairly messy, as twice 
differentiation of, for example, a(go(x))C(x) w.r.t. x produces four terms.) 
In the RHS of this expression, the second third and fourth lines are zero. 
To understand the last one, we take the limit h±{9) — > ho(9), i.e. the NTC- 
optimal market value that corresponds to 9, to obtain 



so the fifth line also vanishes. Hence the two value functions / and /ntc, 
for a NT zone of zero width, and in the absence of transaction costs, differ 
by some element of ker[— r + £,], which we argue must be zero because of 
the behaviour at x — > ±00. 

An alternative route to this conclusion (which is messier, but more di- 
rect) is simply to express the Green's function in terms of the complementary 
functions, and hack out all the various integrals. 

Limit of small transaction costs 

We study the solution for small e and in particular wish to study the width 
of the NT zone, as well as whether the NT zone is displaced relative to the 
costfree strategy 9 = go(X). By width we can either mean in the horizontal 



a'_(9) 



a> + (9) 



-C-Or)(diJ)(/io(fl), 9) + C'_(h {9))I{h {9), 9) 

' w{c+,c_}(M0)) 

C+QcXfli J)(Mfl), 9) - C' + Ch (9))I(h (9),8) 
W{C + ,C-}(h (9)) 



But now when 9 



go(x), or equivalently x = Iiq(9), we have 



a' + (go(x))C' + (x) + a'_(go(x))C'_(x) = 
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direction ('X- width') or vertical direction ('0- width'). We develop ()15I16[) 
around the point (e_ = e+ = 0, h+ — h_ = 0). It is convenient to write 

K{e) = \{h + {9) - h-(0)); h m {6) = \(h + {6) + h-(6)), 

with w for half-width and m for mid-point. Also, as the mid-point is near Hq 
(where ho(go(x)) = x), we can split out the displacement of the mid-point 
from the transaction-free case as 

h m = h + h d , 

with hd (d for displacement) small. 

It is useful to recall some results from differential algebra. Define first 

Wijj = cfc { l ] - c®cf 

where superscripts denote derivatives; this is a function of x (and when 
necessary it will be evaluated at x = h m ). Notice that Wi,o = W{C_, C + }, 
the Wronskian. An important result following directly from (|5|7p is 

W 2 ,i _ r W 2 ,o _ n(x) 

Wi, \a{xY' W lja \a{xY K ' 

(a principle which is expanded upon below) and so, in operator notation, 

\a{xf (w 2A - W 2fi ^ + W 1 , -^j = Wx, ■ (-r + C). 
Furthermore, by differentiating again, 

1 / n9 ( „ r d d 3 \ I d u(x) + a'(x)a(x) \ , 

¥(xf (w 3 , _ W3S - + Wlfi - j s Wlfi I- _ ^AJ j ( _ r+£) . 

This is important, as all the unwieldy expressions containing terms such as 
C+(x)C-(x) — C + (x)C'"(x), which arise in the Taylor series expansion of 
(|15|16p , simplify to elementary functions of the coefficients of the underlying 
ODE, and hence to /x(x), cr(x), r, and their derivatives, which are known 
immediately. 

To make a start on the analysis we note first that 

D(h+,h-) = 2h v) W 1 ,o + 0(hl) 

(by symmetry arguments there is no O(Zi^) term). The next important point 
is that when we develop all the terms of (I15)16p in the vicinity of h m , the 
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0(h w ) term in the /-terms (i.e. the terms without e on the front) vanishes. 
Recall e := |(e_ + e + ). 



Difference (115p — (|16p . The symmetry of the non-e terms is odd, so we 
are interested in the cubic term. The expansion is 

AeW lfl - 1(^,1 - W 3 , di + W lfi df)l(h m , 9) ■ h 3 w = 0(eh 2 w , h 5 J 

Recalling that I(x,9) obeys (|12p . we can simplify the expression using this 
and (J25J| and the fact that (d 2 U)(h (9),6) = 0, to obtain 

4eWi fi H H^i.o^u, - 0{eti w ,n w ). 

3a 2 {h ) 

Using ([I]) we can simplify the second term on the LHS, and obtain fjlTf) . 
Sum (I15p + (I16p . The symmetry of the non-e terms is even, and so we have 

teKW 2 ,o - 4(W2,i - T^ 2 , 9i + W lfi dj)l(h m ,9) ■ h 2 w = 0{eh 2 w , hi). 

The second term on the LHS emerges (again applying (|25ll2j) ) as 

4h 2 ■ ~ 

" W 1 , (d 2 U)(h m ,0), 



2°(hm) 
so we obtain 

n(h m )e ~ (d 2 U)(h m , 9) h w . 
Again we use (d 2 U)(ho(6), 9) = to obtain 

H(h )e ~ {did 2 U)(h ,9)h w h d , 

and we end up with (fT9j) . 



Notes on invariants of ODEs 

If 



then, defining 



we have 



d2 y , t \ d y i / ^ n 



W 2fi W 2 ,i W 3fi 2 , W31 , 

tft- = -p; tft- = q; 777— =p -p -q; ttt- = q -pq- 

W 1)0 Wi )0 Wifl Wi fi 
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Note that = — W{yi,j/2} with W denoting the Wronskian. The first 
two results are somewhat analogous to the sum and product formulas for 
the roots of a quadratic. The analogy is made obvious when one considers 
Ui{x) = e XiX , i = 1, 2, in which case — p and q are constants and equal to the 
sum and the product of the (Aj). 

More generally, one can recognise that Wij/Wk,i is always a rational 
function of the coefficients of the ODE (p, q) and their derivatives. This is 
because it is invariant under any transformation of the form y\ i— > ay\ + by 2 , 
U2 ^ cy\ + dy2 with ad — bc^ 0, i.e. the action of the matrix group GL^fC). 
Hence it is also invariant under the action of the differential Galois groud_| of 
the differential field extension C(x,p,q,yi,y2)/C(x,p,q) (this group being 
a subgroup of GLi2(C)), and therefore must be contained in C(x,p,q). In 
the same way in the case of a quadratic equation y 2 + py + q = 0, with 
roots yi 2 , expressions that are symmetric under yi «-> y 2 , such as v \_ s y % , 
are expressible as rational functions of p and q, whereas nonsymmetric ones, 
such as yi + 2^2, are not. 

Notes on the OU case and on 

Some more details of the OU case are now stated for completeness. The 
ODE ([6]) is easily solved to give the costfree value function as 

; , . Gb /i 9 b\ x — a/b , , 

*w -jf+^I 4 ' ■*•?)■ ss ^r (26) 

Note that 3 is the 'z-score', and b/r is the price of a riskfree perpetual bond 
paying a coupon of b. Also I(x, 9) is given by 



l + r/6 rG' v 7V ' 7 l + r/6" 
The complementary functions C±{x) are given by 

C ± (x) = D± 6 (^^) , D±(z) := f e^z^e^ dz. (27) 

The functions are related to the parabolic cylinder functions, and here 
are some of their properties, by which the numerical implementation of the 
integral can be checked: 

. d±(o) = 2<v 2 - 1 rm 



3 See e.g. I. Kaplansky, An Introduction to Differential Algebra, Hermann, Paris, 1957. 
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• d±'(o) = ±2^- 1 )/ 2 r(^±±) 

• D+(x) ~ Y{y)/{— x) v for x — > — oo (symmetrically for D~). 

• Djjr(x) ~ \/2vrx i/ ~ 1 e x2//2 for x — )• +00 (symmetrically for D~). 

• For small ^ > we have D^(x) « £ + J °°(lnz)(z — x)e ±zx_z2 / 2 cfe. 

(NB: Our definition (|27p is convenient, but nonstandard; see also pQ.) The 
Wronskian is, with the chosen normalisation, 

W{C_,C+}(x) = r(r/6)/0( 3 ), 

with 3 denoting the z-score as above, and <f> the standard Normal pdf. 
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